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Abstract 

Background: Zymoseptoria tritici is a hemibiotrophic ascomycete fungus causing leaf blotch of wheat that often 
decreases yield severely. Populations of the fungus are known to be highly diverse and poorly differentiated from 
each other. However, a genotyping tool is needed to address further questions in large collections of isolates, 
regarding regional population structure, adaptation to anthropogenic selective pressures, and dynamics of the 
recently discovered accessory chromosomes. This procedure is limited by costly and time-consuming simplex PCR 
genotyping. Recent development of genomic approaches and of larger sets of SSRs enabled the optimization of 
microsatellite multiplexing. 

Findings: We report here a reliable protocol to amplify 24 SSRs organized in three multiplex panels, and covering 
all Z. tritici chromosomes. We also propose an automatic allele assignment procedure, which allows scoring alleles 
in a repeatable manner across studies and laboratories. All together, these tools enabled us to characterize local 
and worldwide populations and to calculate diversity indexes consistent with results reported in the literature. 

Conclusion: This easy-to-use, accurate, repeatable, economical, and faster technical strategy can provide useful 
genetic information for evolutionary inferences concerning Z tritici populations. Moreover, it will facilitate the 
comparison of studies from different scientific groups. 

Keywords: Microsatellite, Single sequence repeat (SSR), Genotyping, Multiplex, Population genetics, Diversity, 
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Background 

Zymoseptoria tritici (syn. Mycosphaerella graminicola, ana- 
morph Septoria tritici) [1] is an ascomycete fungus that 
causes Septoria leaf blotch, the most important foliar dis- 
ease of winter wheat in Northern Europe. Moreover, this 
species has adapted to fungicides (especially from the azole 
and strobilurin families), which have been used intensively 
since the 1970s [2,3]. Intensive research has been under- 
taken to understand the biology of this fungus and the 
mechanisms of its pathogenicity, with the objective to de- 
velop durable management tools [4], The Z tritici genome 
has been fully sequenced and annotated, which revealed a 
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high chromosome plasticity [5]. The sequenced genome 
contained 21 chromosomes, eight of them being faculta- 
tively gained or lost during meiosis in some isolates and 
constituting the dispensome, which accounts for up to ap- 
proximately 12% of the genome size [5,6]. It was recently 
demonstrated that these accessory chromosomes origi- 
nated mainly from ancient core chromosomes through a 
degeneration process that included breakage-fusion-bridge 
cycles, nondisjunction and mutational decay of duplicated 
sequences [7]. 

Z. tritici populations have been shown to be highly di- 
verse on scales going from leaf to continents [8]. Whole- 
sale gene flow and frequent recombination are supposed 
to occur at a global level [9]. Nevertheless, population 
differentiation has been detected in Iran, western USA, 
Germany and France [9-13], i.e. several of these areas 
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were not included in the previous worldwide surveys. 
Additional large collections of isolates need to be ana- 
lyzed to confirm and refine the population differenti- 
ation observed in those specific areas, and to identify the 
factors that influence the differentiation (for example, 
anthropogenic selection pressure or host/cultivar adap- 
tation; seasonal adaptation; history of geographic spreads). 
Those questions should be answered in parallel on the 
so-called core genome (chromosomes 1 to 13) and the 
accessory (or dispensable) genome (chromosomes 14 
to 21), which are characterized by strikingly different 
organizations [6,7] and rates of evolution [14] likely to 
result in diverging dynamics between the two compart- 
ments. Such a compartmentalized genome has been 
termed a "2-speed genome" [7]. 

Several marker sets are available to study the genetic 
structure of Z tritici populations, ranging from sequence 
of mating- type genes [15-17], random amplified poly- 
morphic DNA (RAPD) [18], restriction fragment length 
polymorphism (RFLP) [8] to amplified fragment length 
polymorphism (AFLP) [11,19]. A limited set of microsat- 
ellite/simple sequence repeat (SSR) markers was devel- 
oped in the late 1990s [20] and used for field studies 
[10,21]. It was later complemented by a larger set of 99 
new SSRs, derived from an EST database [22], and used 
at least in one study [23]. SSRs are of particular value 
for population studies, since they are co-dominant, easy 
to use, and highly polymorphic markers. As diversity in 
Z tritici is unanimously considered high, a reduced 
panel of adapted SSRs should suffice to analyze popula- 
tion genetic structure [17]. Ideally, such markers should 
be genetically independent, i.e., reside on different chro- 
mosomes, and selectively neutral to avoid bias in subse- 
quent population genetic analyses [24]. Moreover, multiplex 
PCR, le., the simultaneous amplification of several markers 
within a single reaction, is a good way to improve the SSR 
genotyping efficiency and to considerably reduce cost and 
labour [25]. 

Here, we optimize and complete a method that enables 
the amplification of 24 highly informative SSRs from Z 
tritici, multiplexed into three independent panels. 

Findings 

Development of three efficient and multiplexed panels of 
SSRs for population studies 

To ensure relative independence of markers, and to de- 
tect the variably present accessory chromosomes, at least 
one SSR was chosen per chromosome. Therefore, the 9 
[20] and 99 [22] previously developed SSRs were mapped 
in silico by blast-searching their primers against the Z tri- 
tici genome at the JGI Genome Portal website (http://gen- 
ome.jgi-psf.org/) [5]. The chromosome position of 7 and 
73 SSRs was finally determined in each marker set, re- 
spectively (Figure 1). Blast of the primers from the 



remaining markers yielded non-specific or no result 
(Table 1). Interestingly, the position of 21 from the 23 
SSRs genetically mapped by Goodwin et al [22] was con- 
firmed while the last 2 could not be positioned in silico. 

Additionally, in order to increase the number of avail- 
able SSRs from which to choose markers to constitute 
the panels, microsatellites were mined from the fasta- 
formatted genome of strain IPO-323 using the same 
pipeline of Perl scripts and C++ program that has been 
used to obtain microsatellites from EST sequences of 
higher plants [26] . This pipeline output primer sequences, 
motifs and repeat counts of simple and compound micro- 
satellites, primer annealing temperatures, and primer loca- 
tions in the genome. It searched for motifs of one to four 
nucleotides, with respective minimum repeat counts of 
12, 6, 4, and 3. Up to 10 percent of mismatching positions 
were allowed, and the maximum distance between con- 
secutive repeat blocks was five nucleotides in compound 
microsatellites. Primers were chosen to amplify products 
100 to 500 nucleotides long, and to anneal at 60 ± 1.2°C, 
with preference to annealing temperatures in the interval 
60.0 ± 0.2°C. In addition to the assembled genome of IPO- 
323, 86,776,768 short (45-base) reads were available for 
strain IPO-94269 (unpublished data). Reads with suffi- 
ciently high quality scores were mapped to the genome of 
IPO-323 by blastn, and a Perl-collated consensus of these 
reads provided an approximate assembly of the genome of 
IPO-94269. IPO-94269 lacks the accessory chromosomes 
18 and 20, present in IPO-323 [5]. Otherwise, it was as- 
sumed for the sake of tractability that the two strains dif- 
fered only in substitutions, and not in indels. Another Perl 
script blast- aligned the microsatellite primers from above 
to the IPO-323 and IPO-94269 genomes, and selected the 
primer pairs that hit each genome exactly once. For chro- 
mosomes 18 and 20, we selected the primer pairs that hit 
the genome of IPO-323 exactly once. Together, these were 
the 3781 "specific" microsatellites that are expected to 
produce single products from both genomes. Details on 
these SSRs are available in Additional file 1. A subset of 
119 microsatellites having at least 50, 17, 10, or 8 repeats 
respectively for motifs of 1, 2, 3, or 4 nucleotides is also 
available in Additional file 2. 

Thereafter, markers were chosen according to (1) their 
chromosomal location, (2) their polymorphism, (3) their 
allele size range, (4) their specificity towards related spe- 
cies and (5) the annealing temperature of their primers. 
For those reasons, candidate markers were selected in pri- 
ority from the published sets because their polymorphism 
and species specificity had previously been assessed in a 
number of isolates [16,21,23]. SSR markers from core 
chromosomes were assembled in multiplexes 1 and 2 (8 
markers per multiplex), and SSR markers from accessory 
chromosomes were combined in multiplex 3. The forward 
primers were labeled with fluorescent dyes (blue, green or 
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Figure 1 Position of 80 published microsatellite loci on the genome of Zymoseptoria tritici (scale of the chromosomes is in base pairs) 

[20,22]. The graphical representation of the map was constructed using the software MAPCHART [33]. The microsatellites used in the three 
multiplexes are represented in larger font and in bold with their new name (correspondence to the previously published name can be found in 
Table 1). The colour of a microsatellite locus indicates in which multiplex the marker is integrated: red for multiplex 1, blue for multiplex 2 and 
green for multiplex 3. An asterisk indicates that the marker is reported for the first time. The positions of the housekeeping genes ILV5 (ID 102127) 
and SOD (ID 102956) are also indicated on chromosome 1. 



black from Sigma- Aldrich®) in a way that the same color 
was given only to markers with non-overlapping range of 
allele sizes. Candidate markers were individually tested on 
eight representative Z. tritici DNAs from diverse origins, 
and on DNAs from fungal species closely related to Z. tritici 
and of agronomical importance [27], including three Zymo- 
septoria passerinii, one Stagonospora nodorum and one 
Mycosphaerella fijiensis. When markers were clearly 
amplified, were specific to Z. tritici and were found 
polymorphic between isolates, they were assembled in 
multiplexes and tested on the same DNA samples. Ac- 
cording to the band patterns observed on high resolution 
agarose gel, multiplexes were optimized by means of 
marker exchange until all markers within a multiplex pro- 
duced the expected amplification pattern at the consensus 
annealing temperature of 57°C, with the exception of 
multiplex 3 for which null alleles were allowed. The 
marker ST1E7 [20] was included in multiplex 2 to allow 
comparison with previously published datasets [10]. Fi- 
nally, primers amplifying a 108 bp fragment from the 
housekeeping gene ILV5, keto-acid reductoisomerase 
(Chromosome 1, Protein ID 102127), and a 96 bp frag- 
ment from the housekeeping gene SOD, superoxide 
dismutase (Chromosome 1, Protein ID 102956), were 
introduced in the multiplexes to serve as amplification 



controls. This is particularly important for multiplex 3, 
since isolates without any accessory chromosome may 
be detected in field populations. 

Once the three panels were optimized on the limited 
number of samples, the procedure was automatized for 
high-throughput routine work. Fresh sporidia, collected 
from 5 day-old mono-pyenidia cultures on malt-yeast 
agar MYA (20 g.l" 1 malt extract, 5 g.l" 1 yeast extract and 
12.5 g.l" 1 agar), were distributed in 96-well plates of the 
following extraction kit and ground for 90 s at 50 rps 
with a tungsten bead in 50 ul lysis (AP-1) buffer (65°C) 
with a Retsch-MM300 grinder. DNA extraction was 
carried-out using the Qiagen DNA 'Biorobot 3000' ex- 
traction robot and the Qiagen DNeasy Plant Maxi Kit, 
following the manufacturers protocol. After extraction, 
DNAs were quantified on a Nanodrop spectrophotom- 
eter (Thermo Scientific®), adjusted to 15 ng.ul" 1 and 
stored at -20°C until use. 10 ul PCR reactions were per- 
formed using the Type-it microsatellite kit (Qiagen), spe- 
cifically designed for multiplexed PCRs. Each reaction 
contained 2 ul deionized water, 1 ul Q-solution, 1 ul of 
the primer mix (containing 2 uM of each SSR/house- 
keeping gene primers), 5 ul of the Type-it mix and 1 ul 
DNA. Panel composition is detailed in Table 1. PCR con- 
ditions were identical for all multiplexes, with preheating 



Table 1 Composition of three microsatellite PCR multiplexes developed for Zymoseptoria tritici population genetic studies 

Multiplex 3 Chrom. Locus b Locus in Repeat motif Primer sequences Fluores-cent Size Protein Na g Ne h Fna 1 Gene diversity (H £ ) J 

literature c of IPO-323 d (5'->3') dye e range ID f 



Multiplex 1 (core 1 ST1 GAC0002 (GAC) 6 

chromosomes) 



3 ST3A AG0003 (AG) 15 



4 ST4 AC0002 (AC) 7 



5 ST5 GGC0001 (GGC) 8 



9 ST9 CT0004 (CT) 6 



10 ST10 CAA0003 (CAA) 7 



Multiplex 2 (core 
chromosomes) 



F: TCCGTCATCAAC Blue 
AACACCAG 

R: TGGCCGTAGAA 
CTGCTGAG 

13 ST13 AG0009 (AG) 10 (GGCA) 3 F: GACTCCAmAC Blue 

CTGTGGCG 

R: TGTGAAGGACA 
CGCAAAGAG 

F: GTGTGAAACGA Blue 
AAAGCGGAG 

R: TACTGGGTATCG 
AGATCGGC 

F: TGCAGGGCA^ Black 
TAATOAGG 



11 ST11 GCA0002 (GCA) 6 



ST8 CT0007 (CT) 6 



(bp) 

192-222 33894 3 



Total pop. Total pop. Total pop. Local pop. Worldwide pop. 
(n=166) (n=166) (n=166) (n = 33) (n=133) 



F: AATCGACCCOT Black 
COTCAAC 

R: GGGGGAGAGGC 
ATAGTC1TG 

F:AOTGGGGAGGT Black 226-258 103950 16 5.8 

GTOTGAG 

R: ACGAATOTOA 
TOCAGCG 



F: TGAACATCAACC Green 182-206 71387 7 

TCACACGC 

R: AGAAGAGGACG 
ACCCACGAG 

F: GATACCAAGGT Green 232-256 104484 6 

GGCCAAGG 

R: CACGTOGGAG 
TGTCGAAG 

F: CACCTCACTCCT Green 336-348 62567 6 

CAATOCG 

R: GAAAGGTOGT 
GTCGTGTCC 



139-160 62856 



1.4 0.6% 0.305 0.308 



2.4% 0.785 0.825 



2.0 0.0% 0.479 0.504 



1.4 0.0% 0.114 0.345 



1.7 



0.404 0.413 



2.7 0% 0.624 0.631 



192-200 106529 5 2.4 0% 0.566 0.587 



293-296 - 2 1.1 17.5% 0.071 0.087 



180-188 61457 5 1.6 40.4% 0.322 0.404 



R: TCCATCCAmA 
GGCTCGTC 



Table 1 Composition of three microsatellite PCR multiplexes developed for Zymoseptoria tritici population genetic studies (Continued) 



12 ST12 TCC0002 (TCC) 6 



3 ST3B CAA0005 (CAA) 8 



6 ST6 TCC0009 (TCC) 8 



3 ST3C GCA0003 (GCA) 7 



2 ST2 Chr02-140 (GCT) 12 



11 ST1E7 ST1E7 (CGG) 3 



7 ST7 AC0001 (AC) 2 



Multiplex 3 15 

(accessory 

chromosomes) 



ST15 AC0007 (AC) 1: 



21 ST21 GGA0001 (GGA) 6 



18 ST18 Chr1 8-007 (GGA) 5 



F: GAATCCACCTCT 
TCOTGCC 

R: AGGAGGATATC 
AAGGCCCAG 

F: AAGAATCCCAC 
CACCCAAAC 

R: CACACGGCTCC 
mGACAC 

F: TCAATOCCAAT 
AATOGGG 

R: AGACGAGGCAG 
TTGGTTGAG 

F: TCCTATCAACTC 
CCGAGACG 

R: CCGCACGTAGG 
AA^CAG 

F: ACACCAAAGAA 
GGATCCACG 

R: GCCGGAGGTCT 
ATCAGmG 

F: GATCTCGAGCA 
GGGCGGAAGT 

R: TCACACGCTGGT 
CTGTGAATC 

F: CACCACACCGT 
CGTOAAG 

R: CGTAAGTOGTG 
GAGATGGG 

F: TGCTCGCAAGAC 
ATAAAACG 

R: CTCTOGCATO 
GTCGGTGG 

F: GTACGACACGG 
GCTATGGAG 

R: GGCGATGACGA 
TGAAACC 

F: AGAGGCAATGG 
TGGGTGAT 



Black 226-232 111480 3 1.1 5.4% 0.059 0.137 



Black 263-: 



Blue 



10 2.9 0.6% 0.663 O.t 



Green 161-179 73474 6 1.5 3.6% 0.242 0.342 



Green 229-253 108534 5 2.2 1.2% 0.580 0.533 



Green 338-365 108062 9 2.9 



0.439 0.6 



1.5 20.0% 0.204 0.373 



Blue 171-227 105049 15 1.9 0.6% 0.463 0.487 



Black 92-152 106622 15 8.0 16.9% 0.858 0.£ 



Black 149-161 111806 5 1.8 32.5% 0.632 0.640 



Black 245-272 111768 2 1.2 71.1% 0.411 0.430 



Table 1 Composition of three microsatellite PCR multiplexes developed for Zymoseptoria tritici population genetic studies (Continued) 



19 ST19 Chr1 9-031 (TCG) 6 



20 ST20 Chr20-066 (GAA) 22 



16 ST16 Chr16-016 (GAA) 6 



17 ST17 Chr1 7-040 (AGA) 5 



14 



ST14 Chr1 4-023 (TCC) 20 (TCG) 5 



R: TCCTCCTCGTCC 
GACAATAC 

F: CTACTGTAmC Black 
CCGGGGGT 

R: ACCCCTCTCTCC 
TOTCTCG 

F: CTCCTCCACATC Green 
CATCCAAC 

R: GACGGAGGGGG 
AGAGGTAT 

F: TTCACGGTATCA Green 
CAGACCCA 

R: GGGTOTCCAAG 
CJGTTGTT 

F: ACAAGAGGCGG Blue 
AAGACTGAA 

R: TCATGCGTCGTA 
TOTOGA 

F: CAAACAGCCAGA Blue 
TCCACC^ 

R: GTCGTGGTCGGA 
GAGAGAAG 



351-363 97944 5 



229-: 



19 



496-538 97708 7 



236-239 97821 2 



361-529 111734 36 



2.3 12.6% 0.700 0.631 



13.9 10.8% 0.891 0.914 



1.2 13.3% 0.369 0.364 



1.0 18.7% 0,367 0.311 



12.1 



13.8% 



0.898 



0.920 



a Primers for the housekeeping gene SOD are added in multiplexes 1 and 3 (green fluorescent dye; 96 bp amplicon; F:CTACCGATGCTCCAGCTGAGA and R:CACCACCCTTGCCATCCA), and primers from the housekeeping 

gene ILV5 are added in multiplexes 1 and 2 (black fluorescent dye; 108 bp amplicon; F:GCTCTATCGCCATCTTCCAGG and R:TCCTTCTCG A AG GTG GTCTTGT) . 

b Proposed name of microsatellite loci starting with the fungus initials and followed by the chromosome number on which the microsatellite is located (except for ST1E7). 

c Loci name found in the literature [20,22] or in the Additional file 1 (primers for Chr1 8-007, Chr20-066 and Chr1 7-040 have been modified from the original design to fit the multiplex range size requirements). 
d Repeat motif of IPO-323 as published in [20,22], with exception of ST7 and ST13, for which the motif was found slightly different after sequencing. 

e Primers were purchased from Sigma Aldrich (France). They are WellRed oligos, licensed by Beckman Coulter (USA), and bearing fluorochromes at their 5' end: D1: red, D2: black, D3: green, D4: blue, 
identity number of the protein corresponding to SSR loci identified within expressed sequences (Filtered Gene Models from Mgraminicolav2 at JGI). 
g Number of alleles identified in the total population (excluding null alleles). 
Effective number of alleles (excluding null alleles). 

'Frequency of missing data or null alleles (no amplification) in the total population. 

J Unbiased gene diversity H E calculated as the probability that two individuals taken at random have different genotypes (excluding null alleles) [31]. 
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at 95°C for 5 mn, followed by 30 cycles of 95°C for 30 s, 
57°C for 90 s and 72°C for 30 s, with a final extension step 
of 60°C for 30 mn, using a MJ research PTC-200 thermo- 
cycler. The PCR products were run on a 2% agarose gel to 
check for amplification products. Sequencing reactions 
were prepared as follows: for multiplexes 1 and 2, 0.8 \A 
PCR product, 40 \A Sample Loading Solution (SLS) and 
0.5 \A 400 bp size standard (red dye); the mix for multiplex 
3 was the same, with the exception that 0.5 \A 600 bp size 
standard was used instead of the previous one. Fragments 
were separated using a Beckman Coulter CEQ-8000 DNA 
Analyzer. Results were visualized using the CEQ-8000 
Genetic Analysis System Software (Beckman Coulter). 

Automation of allele scoring 

We automated the allele scoring procedure using the 
binning analysis function of the CEQ-8000 Genetic Ana- 
lysis System Software (Beckman Coulter) on the 166 geno- 
typed samples (see below for description of populations). 
Such automation helps on one hand to solve problems re- 
lated to the size assignment of ambiguous peaks produced 
on the chromatogram by Taq polymerase stuttering or 
fluorescent dyes reflection, and provides on the other 
hand a reproducible scoring method across readers and la- 
boratories. In short, the software first provides for each 
marker a distribution of peak height and approximate size 
relative to the size-standard ladder. Each marker was then 
manually delimited ('binned') for each allele according to 
its size variability and a minimal peak height, in order to 
eliminate false positive or aberrant peaks and to minimize 
bin width. Binned alleles were finally assigned a definitive 
name (generally derived from the mean allele size) and al- 
lele size ranges were established (Table 1). The generated 
binning files could be used to annotate allele peaks from 
new data in any laboratory using a Beckman Coulter DNA 
Analyzer, and are described in Additional file 3 and pro- 
vided as Additional file 4 or upon request to the authors. 
The alleles' sizes of IPO-323 and other 100 reference 
isolates are provided as examples of this procedure 
(Additional file 3) to help further assignment by other 
operators. Resequencing data of all these reference isolates 
will be soon released on the JGI Genome Portal website 
(http://genome.jgi-psf.org/). This procedure assures reli- 
able and accurate allele scoring, even when different or in- 
experienced operators manipulate the data, increasing the 
reproducibility between distinct datasets generated over 
time. This procedure can also be progressively improved, 
since new alleles of the same markers and also completely 
new markers can be implemented in the "binning files". 

Checking the quality of the multiplexed SSR panels 

Frequency parameters were calculated for the 166 indi- 
viduals (see below for the description of the populations) 
with the software GENALEX 6.4 [28]. For each marker, 



the number of SSR repeats present in the isolate IPO- 
323 was determined according to its genome sequence 
and used to infer the number of repeats from all the 
other alleles. In a few cases, fewer than 0 repeats were 
deduced for alleles present in never more than one to 
three individuals; such alleles were coded as missing al- 
leles and were not considered further. Excluding those, a 
total of 204 alleles were detected. The number of alleles 
per SSR ranged from 2 to 16 for panels 1 and 2 on the 
core chromosomes (Table 1). Markers on the accessory 
chromosomes 14 and 20 had particularly numerous alleles 
with 36 and 20 detected alleles, respectively. The 3 markers 
ST11 in panel 1, and ST8 and ST1E7 in panel 2 could not 
be amplified in a large number of isolates probably because 
of the presence of null alleles (Table 1). The same observa- 
tion is true for all markers from panel 3 on the accessory 
chromosomes with 11 to 71% null amplifications, on chro- 
mosomes 20 and 18, respectively. This result is supported 
by other studies, which revealed that chromosome 18 is 
the accessory chromosome the most often missing in field 
isolates of Z. tritici [7] . We also sequenced the PCR prod- 
uct of ST 18 from 3 isolates carrying the 245 bp allele 
(IPO-02166, IPO-98032, IPO-95054) and 2 isolates carry- 
ing the 272 bp allele (IPO-98038, IPO-95052) (Table 1) 
and found that this marker was not polymorphic at its 
microsatellite motif (GGA) 5 . The allele size difference at 
ST 18 is caused by an insertion of 27 bp in the larger allele. 

The eight accessory chromosomes exhibit strong differ- 
entiation between strains, in number but also in length of 
chromosomes [7]. Indeed, the frequency in presence or 
absence of chromosomal segments varies strongly along 
each accessory chromosome. For example, chromosome 
14 is present in more than 80% of the strains while it har- 
bors a substantial insertion present in less than 20% of 
these same strains [7]. Except for chromosome 18, the 
SSR markers from panel 3 on accessory chromosomes are 
in conserved segments of their respective chromosome, 
which make them reliable indicators of the presence or 
absence of each accessory chromosome. The presence or 
absence of chromosome 18 is particularly difficult to pre- 
dict due to its high differentiation between strains from 
different geographical origins [7]. 

Non-neutral loci may bias population genetic studies 
[24]. To identify possible outlier loci in the panels, the 
24 SSR markers constituting the three multiplex panels 
were subjected to analysis with the selection detection 
workbench LOSITAN [29]. LOSITAN detects outlier loci 
from the expected distribution of Wright's inbreeding 
coefficient F st versus the expected heterozygozity (or gene 
diversity) H e under an island model of migration with neu- 
tral markers. A neutral mean F st was computed after ex- 
cluding loci for which selection had been detected initially. 
The mean observed F st value was 0.005, while individual 
marker F st values ranged from -0.036 to 0.061. Only one 
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under positive selection; in grey, the confidence area for neutral loci; in yellow, the confidence area for candidate loci potentially under balancing 
selection. The names of the loci are indicated close to their position on the chart. 



out of the 24 SSR markers, ST 18, was declared an outlier, 
since it appeared in the confidence area for loci potentially 
under balancing selection (Figure 2). Either directional se- 
lection or genotyping errors can generate outlier effects. 
The very high proportion (71%) of null alleles for ST 18 is 
the most likely explanation for this marker being detected 
as an outlier. The other markers are considered as neutral. 

Validation of the multiplexed SSR panels: diversity 
measurement in two populations 

To validate the SSR multiplexing procedure, the auto- 
mated allele assignment, and the value of the selected 



SSR markers for population genetic studies, two distinct 
fungal populations were genotyped following the pro- 
posed protocol. Isolates belonging to a first, global, 
population (n = 133) were collected worldwide over 
25 years; they covered 28 countries from the five conti- 
nents, including different areas in France, and were re- 
trieved from laboratory collections. Isolates belonging 
to a second, local, population (n = 33) were collected dur- 
ing the same crop season 2009-2010 in a single experi- 
mental field plot in Grignon, France (48°51'N, 1°58'E; 
[30]). Samples were genotyped and analyzed as previously 
described. 




Figure 3 Clonal richness and gene diversity measured in local and worldwide Zymoseptoria tritici populations according to the number 

of SSRs used in the analysis 3 . a Clonal richness, measured as the number of unique genotypes compared to the number of isolates (G/N) is 

represented with grey squares. Gene diversity, estimated as the mean expected heterozygosity over loci [31] is represented with black diamonds. 

Both indexes are calculated for each multiplex and for all of them according to the number of SSR markers involved in the calculation. 
\ . J 
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For further index calculations, missing data were coded 
as missing alleles for multiplexes 1 and 2, and as null al- 
leles for multiplex 3. Gene diversity, estimated as the 
mean expected heterozygoty H e over the loci [31], and 
clonal richness (G/N; number of unique genotypes 
compared to the number of individuals) were plotted 
against the number of loci used to calculate these indexes 
in the three multiplexes and for all of them (Figure 3), 
using the MULTILOCUS program [32]. As expected, gene 
diversity was similarly high in the two populations, and 
exceeded 0.95 after 4 loci were included in calculation for 
multiplexes 1 and 2, and after two loci were included for 
multiplex 3, probably because null alleles are considered 
as distinct alleles for markers located on the dispensome. 
The 24 SSR loci present in the three panels are then suffi- 
cient to saturate the whole gene diversity, at a local and a 
larger scale. The observed ranges for gene diversity, for 
single loci (Table 1) or for genotypes (Figure 3), are similar 
or even greater compared to what has been reported in 
the literature for various Z. tritici populations, either with 
SSR, RAPD, AFLP or RFLP markers [9-13,18]. The 24 
SSR loci are also sufficient to detect possible clonal indi- 
viduals as only 11 and 23 markers were necessary to dis- 
tinguish all haplotypes in the local and world populations, 
respectively (Figure 3); in practice 164 haplotypes were 
recognized among the 166 isolates. Such a low proportion 
of clonal individuals (G/N) corroborates several studies 
pointing out the importance of sexual recombination in 
field populations of Z tritici [9,13,17,18,21]. 

Conclusion 

The proposed multiplex PCR genotyping method, together 
with the automatic allele assignment procedure, constitutes 
a reliable tool for population genetics studies on Z tritici. 
Moreover, these ready-to-use methods enable important 
savings in terms of money, time, reproducibility and accur- 
acy, and produce results in agreement with others previ- 
ously published about Z tritici populations. As the chosen 
SSRs are highly polymorphic and describe most of the 
population diversity, the three multiplex panels could be 
used "a la carte" to address specific questions. For example, 
the use of a single panel (either multiplex 1 or 2) could suf- 
fice to describe diversity in local/reduced populations, 
whereas both panels representing the core chromosomes 
(multiplexes 1 and 2 together) would be useful to study lar- 
ger unknown populations. Multiplex 3 is of great interest 
to acquire complementary information on the dispensome 
dynamics and evolution [7] of the studied populations, but 
users should keep in mind that false negative results are al- 
ways possible due to the variation in chromosomal content 
between strains. Multiplex 3 should also be used with care 
for population genetics analyses, as the great number of 
missing alleles may bias the results. The three markers with 
null alleles from the multiplexes 1 and 2 (i.e. ST11, ST8, 



ST1E7) may be as well omitted from certain analyses for 
the same reason. These sets of optimized markers will be 
of great relevance to the Z tritici scientific community, as 
they provide a method that can be applied in any labora- 
tory and will allow facile comparison of studies from vari- 
ous locations, scales, and with different aims. The method 
is currently being used successfully in two laboratories to 
genotype different European populations consisting of sev- 
eral thousand individuals. 

Additional files 



Additional file 1: Characteristics of 3781 "specific" microsatellites 
identified in silico from Zymoseptoria tritici genome. Primer pairs are 
proposed for each microsatellite, which hit each genome of strains IPO-323 
and IPO-94269 exactly once. 

Additional file 2: Subset of 119 microsatellites with highly repeated 
motifs. 

Additional file 3: Alleles computed in the automated allele scoring 
procedure and frequency of alleles recorded among 166 individual 
strains. Strains were selected from a local French population and from a 
worldwide collection.The multilocus genotypes of 101 strains is provided 
as reference. 

Additional file 4: Binning files, to be used for automatic allele scoring. 

Files to use with the CEQ-8000 Genetic Analysis System Software (Beckman 
Coulter) to provide automatic assignment of alleles for each SSR marker. 
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